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The Bioloyical Services Program was established within the U.S. Fish 
and Wildlife Service to supply scientific information and methodologies on 
key environmental issues that impact fish and wildlife resources and their 
Supporting ecosystems. The mission of the program is as follows: 


e To strengthen the Fish and Wildlife Service in its role as 
a primary source of information on national fish and wild- 
life resources, particularly in respect to environmental 
impact assessment, 


@ To gather, analyze, and present information that wil] aid 
decisionmakers in the identification and resolution of 
problems associated with major changes in land and water 
use. 


@ To provide better ecological information and evaluation 
for Department of the Interior development programs, such 
as those relating to energy development. 


Information developed by the Biologica! Services Program is intended 
for use in the planning and decisionmaking ocess to prevent or minimize 
the impact of development on fish ana wildiire. Research activities and 
technical assistance services are based on an analysis of the issues,a 
determination of the decisionmakers involved and their information needs, 
and an evaluation of the state of the art to identify information qaps 
and to determine priorities. This is a strategy that will ensure that 
the products produced and disseminated are timely and useful. 


Projects have been initiated in the following areas: coal extraction 
and conversion; power plants; geothermal, mineral and oil] shale develop- 
ment; water resource analysis, including stream alterations and western 
water allocation; coastal ecosystems and Outer Continental Shelf develop- 
ment; and systems inventory, including National Wetland Inventory, 
habitat classification and analysis, and information transfer. 


The Biological Services Program consists of the Office of Biological 
Services in Washington, D.C., which is responsible for overall] planning and 
management; National Teams, which provide the Program's central scientific 
and technical expertise and arrange for contracting biological services 
studies with states, universities, consulting firms, and others; Regional 
Staff, who provide a link to problems at the operating level; and staff at 
certain Fish and Wildlife Service research facilities, who conduct inhouse 
research studies. 
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SUMMARY 


This report discusses the development of a coastal and marine bird 
data base at the Migratory Bird and Habitat Research Laboratory. The system 
is compared with other data bases, and suggestions for future development, 
such as possible adaptations for other taxonomic groups, are included. 


The data base is based on the Statistical Analysis System but includes 
extensions programmed in PL/I. The Appendix shows how the system evolved. 


Output examples are given for heron data and pelagic bird data which 
indicate the types of analyses that can be conducted and output figures. 


The Appendixes include a retrieval language user's guide and 
description of the retrieval process and listing of translator program. 
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INTRODUCTION 


Information about our wildlife resource expands daily. Scientific publi- 
Cations often allude to surveys of wildlife with data in theses or in-house 
reports. Data on groups of organisms are therefore very difficult to collate 
and reorganize for work beyond the objectives of the original survey. 


Current interest in offshore oil drilliag, concern about oil spills, 
and coastal development have focused on the need for a coastal and marine 
bird data base. Surveys by Federal, State, and private organizations 
are not readily available to writers of environmental impact statements. 
The data are also often in such diverse formats that comparisons between 
different locations or different surveys are impossible. For example, the 
literature includes a great deal of information from individuals or groups 
of people making observations of marine birds from ships. Unfortunately, 
time of day or specific location of many observations is not recorded. 

In other instances, we find that surveys are made of shorebirds along a 
particular coastline or of marine birds from ships, but the data are simply 
filed in storage cabinets. 


The diversity in format, the lack of a central data storage system, and 
the difficulty of knowing what information is available cause duplicated 
effort and wasted time. Often those who prepare environmental impact state- 
ments do not survey as much data as necessary and consequently end up with 
inadequate assessments. 


DATA NEEDS 


We have mentioned the need to write coastal environmental impact state- 
ments. A comprehensive coastal and marine bird data base is an essential 
prerequisite in preparing such assessments. These data must be available 
quickly in a format that is easy to use. Data are needed on bird distribu- 
tion, abundance, and major habitat types used during the breeding, migratory, 
and wintering periods. Information should be included on habitat use, 
activity patterns, and effects of changing weather patterns. 


Data storage and retrieval systems must be established so additional 
data gathered each year can be incorporated and summarized. This infor- 
mation could be used by migratory bird managers to determine population 
trends and distribution of coastal and marine birds. Instances involving 
special habitats could be identified for migratory bird specialists. 


Data storage facilities should contain information that research biolo- 
gists can use in determining habitat requirements, nesting areas, and in some 
circumstances, productivity of coastal and marine birds. Data storage 


facilities should not, however, be viewed as a means of conducting research 
without making field observations. A data base on coastal and marine birds 
can provide a foundation for subsequent work by research biologists. 


A central data base would provide conformity in format and help to 
Standardize survey methods. In addition, it would make available to assess- 
ment groups and decision makers those variables which appear most important. 


The U.S. Fish and Wildlife Service, as the nation's major wildlife 
conservation organization, maintains other large data bases, including 
records of the Bird Banding Laboratory, the Dove and Woodcock Survev, 
the Winter Waterfowl Survey, and the Breeding Bird Survey. The U.S. 

Fish and Wildlife Service, other Federal agencies, private organizations, 
States and foreign governments, particularly Canada and Mexico, might 
use the data available in the coastal and marine bird data base. 


This report includes discussion of the development of a coastal and 
marine bird data base at the Migratory Bird and Habitat Research Laboratory, 
types of data that can be accommodated by the system, and examples of 
potential output. This system is compared with other data bases and 
Suggestions are made for future development including possible use for 
other species. The general philosophy of data systems is reviewed in 
the Appendix and a user's manual and the system's description are given. 


RELATIONSHIP AND INTERFACES WITH OTHER SYSTEMS 


A number of data storage systems containing species information have 
been developed by State and Federal agencies. Fxisting endangered 
species information systems, including the Sensitive Wildlife Information 
System {SWIS) developed by the Army Corps of Engineers and a proposed 
U.S. Fish and Wildlife System, are described (Christman and Campbell, 
undated). These systems are usually prepared for various groups, such 
aS management personnel, environmental assessment teams, and policy 
makers, for whom detailed scientific infcrmation is not ordinarily 
required. Our system would complement existing systems by providing the 
information necessary for their preparation in the future. 


There are several other specialized data retrieval systems which 
function very well for their intended purposes but which do not have the 
flexibility to accommodate diverse data needs. Examples are the system 
used by the Alaskan Seabird Survey Group and National Audubon Society's 
Colonial Bird Register at Cornell University. These systems are very 
efficient in processing a large amount of homogeneous data, but they 
Cannot easily accommodate new types of information. They are also 
expensive to modify because any change requires redefining the records 
and altering the processing programs. 


In preparing our system, we have contacted managers of other coastal 
and marine bird data storage systems and have developed, where possible, a 
compatible system. Thus,data from the Alaskan Seabird Survey and the 
Colonial Bird Register can be easily merged with data from our system. 











SYSTEM DESCRIPTION 


After considering system features discussed in Appendix A, we decided 
to use SAS (Barr et al. 1979) as the data base management system, and to 
write a preprocessor in PL/I to translate a retrieval request into the SAS 
language. This approach allows users to have a widely available and well- 
Supported data base management and statistical analysis system and at the 
Same time have customized features. The user has access to a wide variety 
of data management functions and statistical analysis procedures at an 
affordab!e development cost. After the required data are retrieved, the 
user can manipulate the programming language to accommodate the data, 
access the report writing features to produce tables, graphs, maps, and 
figures, or use the statistical procedures. 


Data input can be easily adapted to the form of the source documents. 
Due to data independence, there is no relationship between the input and 
Storage formats. The user can customize the input data format to each of 
the diverse sources of data, often avoiding costly recoding of data into a 
common format. SAS has the flexibility to adapt to virtually any format 
that contains all the required formation. 


We worked with two sets of data in order to develop and implement 
Storage and retrieval of data in the system. The first data set was 
collected in a survey of East Coast heron colonies conducted in 1975 and 
1976 (Custer and Osborn 1977). Variables collected include information 
on the location (latitude, longitude, State, ecoregion, wetland type, and 
status [new, stable, abandoned])of each colony, and the number of individuals 
of each nesting species. 


The second set of data consists of information on seabirds observed 
on pelagic cruises by personnel of the Manomet Bird Observatory in Massa- 
chusetts. The field sheet used on these cruises is shown in Figure l. 


The data are stored in four flat files (tables), logically arranged 
as a three-level hierarchy. The MAIN file, at the first level of the 
hierarchy, contains data on location, weather, sea conditions, and other 
factors that pertain to all species sighted during the observation. 


Instead of reserving space for al! possible species in the MAIN file 
at the first level of the hierarchy, data on each species sighted (or each 
age and sex category of each species sighted) are stored in a record in 
the SPECIES file at the second level of the hierarchy. Space is not reserved 
for species which were not sighted. The SPECIES file contains variables 
such as the number of individuals that are recorded at each sighting. 
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Figure 1. Field sheet for recording observations of pelagic birds. 
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Figure 1 (Concluded) 
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In addition to the variables that are routinely recorded on each 
sighting, a large number of other variables are occasionally recorded. 
Data are often included from diverse special studies which have additional 
variables. This system does not reject nonstandard data although standard- 
ization should be encouraged. To be useful when future needs cannot always 
be predicted, the data base must not be limited in the amount or variety 
of information that can be stored and retrieved. The need for such an 
Open-ended approach is especially important with ecological data because of 
their inherent complexity (Farrell et al. 1979). 


Nonstandard variables are stored in the flat files NUMVAR (for numeric 
variables) and CHARVAR (for character variables) at the third level of the 
hierarchy. The name of the variable and its value are stored so that new 
variables can be included by simply adding a new variable name to the files 
(Geissler 1975). This storage procedure saves much space. This approach 
also allows the system to store a virtually unlimited variety of data with- 
Out wasting space. Of course, some control must be exercised to prevent 
two different variables being given the same name. 





TYPES OF DATA 


Because data on coastal and marine birds are difficult to collect, 
there is an added need for a central base of information. In many 
instances, breeding colonies and nesting habitats are in remote locations 
where access is difficult. Shortage of manpower and funds also complicates 
attempts to census breeding populations. 


Furthermore, few of the species of concern actually breed in the 


United States. For example, the Wilson's Storm-Petrel (Oceanites oceanicus), 





commonly observed in waters off the East Coast during the summer months, 


breeds in Antarctica and on islands in the South Atlantic. The Black-legged 


Kittiwake (Rissa tridactyla) breeds only as far south as the Gulf of St. 
Lawrence on the East Coast, but winters in U.S. waters. A number of shore- 
bird species, common on beaches, mudflats, and estuaries during migration, 
nest in tundra areas of Canada. It is essential to gather information on 
these species which only winter in U.S. waters and, also, those which breed 
in this country because all will be affected by developments and activities 
in coastal areas and at sea. 





Though little long-term population information is available (Drury 
1974), most breeding colonies located along the coasts have been censused. 
Personnel of the U.S. Fish and Wildlife Service conducted a survey of 
colonies of wading birds and seabirds along the Atlantic Coast in 1977, 
and portions of the Pacific Coast in 1979 (in press). Some States have 
developed programs to survey colonies within their borders. Canadian 
Wildlife Service Biologists have collected information on breeding 
distribution and censused colonies since 1967, as part of their "Studies 
on Northern Seabirds" program (Brown et al. 1975; Nettleship 1976). 
Observers are always requested to describe physical features and general 
habitat and numbers of individuals of each species. The data from many of 
these surveys are stored in the Colonial Bird Register at the Laboratory 
of Ornithology, Cornell University. 


The U.S. Fish and Wildlife Service, Manomet (Massachusetts) Bird 
Observatory, and the Canadian Wildlife Service are involved in gathering 
data on the pelagic distribution and use of water habitats by seabirds. 
Numerous observations of seabirds are also reported in the literature 
(Thom 1956; Wiley 1959; Rowlett 1978). Many of these observations do not 
include specific locations; however, some may be useful for supplementing 
data from organized pelagic cruises. These types of data are needed 
particularly when the recent problems of oi] spills and the surge of 
interest in offshore oil exploration, are considered. 


Other studies focus on particular species of birds (Tuck and Squires 
1955; Rees 1961) or on one aspect of the ecology of a number of species 


(Pearson 1968; Bradstreet 1976). Such information also could be useful to 
prospective users of a coastal and marine bird data base. 





OUTPUT EXAMPLES 


Potential users of a coastal and marine bird data base have been men- 
tioned in earlier sections of this report. The following examples show some 
of the analyses possible using data stored in the system and the retrieval 
programs used to accomplish them. Note: The retrieval program simply 
accesses the desired set of data. The tables, maps, and statistical analyses 
shown are the results of SAS programs run on the data retrieved. For a 
description of the retrieval language, see the User's Manual in Appendix B. 


LOCATION OF BREEDING COLONIES AND NESTING SITES 


All assessments of environmental impact on coastal ecosystems should 
consider the location of colonies of breeding birds. The effects of pollutants 
or the disturbances of a development on these colonies should be evaluated. 
Colony and nest site locations can be retrieved, by geographic region, 
from the coastal and marine bird data base. 


In the example below, the Retrieval Program creates a SAS data set 
named SC/76, containing observations of heron colonies from South Carolina 
in 1976. Only the variables COLNAME (colony name), LAT (latitude), LONG 
(longitude), and TOTAL (total number of birds) are included. The user 
Specifies the variables of interest in the VARIABLES statement; the desired 
observations are selected in the IF statement. 


The data set is then passed to a SAS program which produces the list 
of colonies shown in Table 1. 


RETRIEVAL PROGRAM: 





DATA SC/6; 
VARIABLES COLNAME LAT LONG TOTAL; 








IF YEAR = 76 AND STATE = ‘SC' THEN KEEP = ‘YES'; 





The colony location of certain species of birds can be retrieved as 
Shown below. 


RETRIEVAL PROGRAM: 





DATA SC/6; 
VARIABLES COLNAME LAT LONG LBHE LOHE; 








IF YEAR = 76 AND STATE = 'SC' THEN KEEP ='YES'; 














Table 1. Heron colonies of coastal South Carolina, 1976. 


COLONY 


DRUM ISLAND 
PUMPKINSEED ISLAND 
COMBAHEE RIVER 
WHOOPING CRANE POND 
BROAD CREEK FARM 
WHITE BANKS 

DEVEAUX BANK 
DAUFUSKIE ISLAND 
COLLETON RIVER 
MAGNOLIA GARDENS 
SANTEE SWAMP 
ARCADIA PLANTATION 
ST. PHILLIPS ISLAND 
WHITE ISLAND 

BIRD ISLAND 
COLLETON NECK 
KINLOCH PLANTATION 
CAPERS ISLAND 
MAYRANTS RESERVE 


LATITUDE 


3280. 7 
3328.3 
3270.8 
3223.3 
3220.8 
3301.6 
3254.7 
3213.8 
3228.2 
3287.2 
3314.7 
3339.0 
3229.2 
3299.1 
3296.5 
3229.4 
3321.0 
3287.3 
3298.5 


LONGITUDE 


7992.5 
7920.8 
8083.3 
8072.0 
8071.5 
7951.2 
8017.7 
8084.2 
8081.3 
8008.8 
1938.2 
7921.2 
8058.8 
1954.6 
1956.3 
8079.5 
1933.3 
1966.7 
7971.8 


TOTAL BIRDS 


10980 
7470 
3000 
1506 

948 
918 
780 
522 
430 
384 
270 
200 
190 
190 
180 
128 

34 

16 
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The data set created by the Retrieval Program again contains observa- 
tions from South Carolina in 1976. However, in this example, the bird 
species of interest (Little Blue «1d Louisiana Herons) are also specified; 
the species’ names appear as variables in the data set, with the number of 
individuals of each species as the value of that variable. The numbers of 
Little Blue Herons (Florida caerulea) and Louisiana Herons (Hydranassa 
tricolor) nesting in colonies in coastal South Carolina are i in Table 2. 








DISTRIBUTION MAPS 


Although breeding and wintering ranges have been fairly well 
established for many species of terrestrial birds, very little is 
known about the habits and distribution of pelagic birds, particularly 
during the winter. While knowledge of the distribution of seabirds is of 
general interest to birders and biologists, it is essential to impact 
assessment tcams because seabirds are most vulnerable to oil spills. 
The computer-generated maps in Figures 2a-2d and 3a-3d show seasonal 
distributions of two species of seabirds in waters off the East Coast of 
the United States during 1978. Observational data are plotted in 10-minute 
blocks. Densities are expressed by classes based on the total number of 
birds of each species observed in that block and the number of times each 
block was sampled. The notation ‘-' indicates that a block was checked, 
but no birds of that species were observed. The Black-legged Kittiwake 
breeds in coastal areas and on islands in Canada and the Arctic (Fig. 2c). 
Thus, observations of this species are almost nonexistent during the 
summer, while densities and range are greatest during the winter months. 
In contrast, the Wilson's Storm-Petrel (Fig. 3) breeds in the Southern 
Hemisphere. No observations were reported in U.S. waters during 
the winter; however, the species is very common during the summer 
months. An overlay of the extent of Georges Bank off the New England 
coast (Fig. 4) shows Greater Shearwater (Puffinus gravis) use of 
this rich fishery and demonstrates the use of distribution maps for 
determining seabird densities at proposed offshore development sites. 





*RETRIEVAL PROGRAM: 
DATA; 
VARIABLES MONTH LATD LATM LONGD LONGM AREA BLKI WISP; 











KEEP = '‘YES'; 





*In this example, all observations of Black-legged Kittiwakes and 
Wilson's Storm-Petrels are desired; hence, an IF statement is not needed 
to subset the data. "KEEP=YES" implies that al] observations are to be 
included. The retrieval data set will include the variables required 
by the SAS program to produce the maps. 


DENSITY TABLES 


Development of offshore oil sites can greatly affect pelagic birds; 
thus, areas where these species concentrate should be identified. Density 
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Table 2. Nesting colonies of the Little Blue Heron 
and Louisiana Heron in coastal South Carolina, 1976. 


COLONY 


DRUM ISLAND 
PUMPKINSEED ISLAND 
COMBAHEE RIVER 
WHOOPING CRANE POND 
BROAD CREEK FARM 
WHITE BANKS 
DEVEAUX BANK 
DAUFUSKIE ISLAND 
COLLETON RIVER 
MAGNOLIA GARDENS 
ARCADIA PLANTATION 
BIRD ISLAND 
COLLETON NECK 


LATITUDE 


3280.7 
3328.3 
3270.8 
3223.3 
3220.8 
3301.6 
3254.7 
3213.8 
3228.2 
3287.2 
3339.0 
3296.5 
3229.4 


LONGITUDE 


7992.5 
7920.8 
8083.3 
8072.0 
8071.5 
7951.2 
8017.7 
8084.2 
8081.3 
8008.8 
7921.2 
7956.3 
8079.5 
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LBHE 


600 
120 
350 





LAT 
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Figure 2a. Seasonal distribution of Black-legged Kittiwakes, Winter 1978. 
(Plot symbols as follows: '-'=density 0; '1'=0<density<=3; '2'=3< density< =10; 
'3'=10< density< =30; '4'=30< density< =100; '5'=density > 100.) 
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1 pe 2b. Seasonal distribution of wath oy Kittiwakes, Spring 1978. 
(Plot symbols as follows: '-'=density 0; '1'=0<density<=3; '2'=3< density< =10; 
'3'=10< density< =30; '4'=30< density<=100; '5'=density > 100.) 
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ce Summer 


Figure 2c. Seasonal distribution of Black-legged Kittiwakes, Summer 1978, 
(Plot symbols as follows: '-'=density 0; '1'=0< density< =3; '2'=3< density< =10; 
'3'=10< density< =30; '4'=30< density< =100; '5'=density>100. ) 
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Figure 2d. Seasonal distribution of ete or Kittiwakes, Fall 1978. 
(Plot symbols as follows: '-'=density 0; '1'=0< density<=3; '2'=3< density<=10; 
'3'=10< density< =30; '4'=30< density<=100; '5'=density > 100.) 


15 








LAT 








45 ++ + + + + 
44 ++ + + + + 
43 ++ + + + + 
42 ++ + + + + 
; KK XXKXKKK- --- ----- -- - 
; x Kk K KXXXx -- -- - - 
: REX K- OK 2 2 =e =e - - 
: KKXKKE -- *x -~ - --- 
41 ++ + + xkxKXX «xx+xk --- + --- - #¢+ - -- + + + 
: K KKKEXEK &X - ---- - ~ 
: KxxX xxx -- - --- 
H K ----4«kx -- - - 
: x--- - - - 
: -* o- 0ee 
40 ++ +* --- + t- -- ¢-- --- + + + + + + 
: xx x -- - 
: x x -- - 
: x --- - 
: xx x 
: “KK -x -- --- 
39 ++ x +x- + + + + + + + + + + 7 
: x x 
: K-- 
: x 
; * ¥ 
38 + + “+ + + + + + + + + + + + 
: x xx 
: xX 
; -x 
; «x 
: * 
37 4-4----- +----- +----- +----- +----- +----- +----- +-- + + +----- +----- +- 
76 75 74 73 72 71 70 69 68 67 66 65 64 
a. Winter LONG 


Figure 3a. Seasonal distribution of Wilson's Storm-Petrels, Winter 1978. 
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Seasonal distribution of Wilson's Storm-Petrels, Spring 1978. 
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Figure 3c. Seasonal distribution of Wilson's Storm-Petrels, Summer 1978. 
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Figure 3d. Seasonal distribution of Wilson's Storm-Petrels, Fall 1978 
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Density of Greater Shearwaters on Georges Bank, Fall 1978, 
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tables can be produced, from data retrieved, to assist in evaluating such 
regions. The relative density of Black-legged Kittiwakes observed in 1978 

per 10-minute block of latitude and longitude are shown (Table 3) for the 
four~degree block that includes Georges Bank. Missing values occur in blocks 
that were not checked. The program needed to retrieve the desired data set is 
Shown below: 


RETRIEVAL PROGRAM: 
DATA; 
VARIABLES LATD LATM LONGD LONGM AREA SPECIES NUMBER; 











IF SPECIES = ‘BLKI* THEN KEEP = ‘YES'; 





STATISTICAL ANALYSES 


It is useful to determine the relationship between presence and 
abundance of the species, and associated habitat characteristics, in 
order to understand the distribution and habitat use of bird species. 
Several statistical procedures can be used to look at univariate or 
multivariate relations between species numbers and environmental 
variables. A portion of the computer output of a stepwise regression 
is shown for 1978 observations of the Black-legged Kittiwake (Table 4). 
The C. criterion (Neter and Wasserman 1974) was used to select step 4 
as the "best" model. In summary, wind speed, month, latitude, and 
longitude are the most important variables in this set of observations 
explaining the abundance of Black-legged Kittiwakes. Be cautious in 
interpreting these results. The analysis would be more meaningful 
if a larger number of observations were included; it is presented 
here only as an example of a multivariate statistical procedure that 
can be performed on retrieved data. Such models can also be used to 
a limited extent to predict abundance, but they are most useful in 
exploratory analyses of the relationships between the independent 
and dependent variables. 


*RETRIEVAL PROGRAM: 





DATA BLKI; 
VARIABLES LATD LATM LONGD LONGM DEPTH TEMP PRESSURE 
WINDSPD WINDDIR SEA BLKI; 











KEEP = ‘YES'; 





*Description of variables given in User's Manual, Appendix B. 
Other statistical procedures available to users of the data base are 
listed in Table 5. 
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Table 3. Density of Black-legged Kittiwakes in Georges Bank, 1978. Latitude and longitude expressed as 3 digits; 
e.g., 695 represents the 10 minutes between 69° 50' and 70° 0'. 
Longitude 

Latitude JU ge HU J: 054 J: ‘ é BU D/é é p 
435 ° ° ° ° ° ° ° ° ° ° , , ° ° ° 6.45 . ° ° 
434 6 e @ 6 o @ ® @ 5.38 oe e 1.34 e eS e 4.30 e e e 
433 ° ° ° ° ° . , , , ° , - 20.83. , 6.45 . ° ‘ 
432 ° , ° ° - 14.78 2.99 4.03. ° ° ° 5.38 14.34 . 1.08 . , ° 
431 1.54 . 8.06 1.90 3.49 4.18 . 7.53 8.06 . ° ° ° ° , 5.38. ° , 
430 33.96 2.39 1.19 . ° ‘ , ‘ ° ° ° ‘ , 2.15 . 2.15 . ° , 
425 ° ° ° ° ° ‘ ° , ‘ , , ° , ‘ 7.17 10.75 . ‘ , 
424 ° , ° - 21.51. 2.24 . ° ° ‘ ‘ 119 . ° , ° ° , 
423 ° , - 20.16 . 10.75 6.45 1.39. 2.93 . ° 0.98 . ‘ , ‘ ‘ 
422 67.20 . 2.15 . 108 . ‘ 2.15 4.61 6.27 . 1.08 2.15 26.39 1.08 . 1.19 . 
421 23.99 5.17 20.83 2.02 3.23 . 13.98 . , 1.96 . 2.15 5.02 . - 12.71 ~~ . - 2.51 
420 ° 5.38 7.63 . ° ° 3.44 1.08 . ‘ ° ° 1.34 . 1.08 1.08 . 9.10 
415 ° ° ° , ° ° ‘ 3.23 6.99 1.34 . ° ‘ ‘ 2.39 2.50 . - 1.79 
414 - 18.43 1.34 . 7.17 9.68 . ° ° 3.23 1.61 2.87 1.08 . 1.34 . 5.38 
413 - 10.75 1.79 3.16 7.77 8.96 8.60 0.72 4.53 4.18 . ° , , , ‘ 143 . 3.91 
412 ° ‘ ° ° 2.29 . 2.28 . 5.38 5.38 4.30. ‘ , 0.72 . ° - 8.06 
411 ° ° - 18.43 1.54 3.36 0.69 1.24 . 1.08 . ‘ 0.72 0.72 . ‘ 16.13 
410 ° ‘ ° ° 5.38 2.24 2.26 0.7/2 0.72 1.43 . , ° ‘ ° ° - 5.87 
405 ° , 1.08 2.39 8.27 2.15 3.58 . 2.15 1.08 . 0.77 4.30 . , ‘ ° 1.96 
404 ° ° 2.15 1.08 1.08 . ‘ ° ° ‘ ‘ ° ° ° 4.30 . 20.53 1.96 
403 0.98 . ° 1.08 . ‘ , ° ° 179 . ° - 10.08. 0.72 . 3.58 . 
402 0.98 . ° ° ° ‘ ‘ ° ° 119 . ° ° ‘ ° 0.72 . ° ‘ 
401 ° ° ° 2.15 . 2.15 . ° ° ° 1.08 . , , ° ° ° ° ° 
400 ° 2.69 . , ° ‘ ‘ ‘ ‘ ° ° , ‘ ‘ ‘ ° ° ° ‘ 











Table 4. Stepwise regression for Black-legged Kittiwake. Number of 
individuals is the dependent variable; environmental measures are the 
independent variables. Step 5 was chosen as the "best" model using 
the Cy criterion. 


MAXIMUM R-SQUARE IMPROVEMENT FOR DEPENDENT VARIABLE BLKI 


STEP 4 VARIABLE WINDSPD ENTERED R SQUARE = 0.01369218 CCP) = 11.48524000 
DF SUM OF SQUARES MEAN SQUARE F PROB>F 
REGRESSION 4 7585 .44172547 1896 . 36043137 17.89 0.0001 
ERROR 5154 546412.72070911 106.01721395 
TOTAL 5158 553998. 16243458 
B VALUE STD ERROR TYPE II SS F PROB>F 
INTERCEPT ~36 .84351994 
TEMP -0. 12696995 0 .02585743 2556 .27430100 24.11 0.0001 
WINDSPD 0.06908211 0.021462111 1102.61473459 10.40 0.0013 
LAT 0.46984051 0.1196 1637 1635.67133665 15.43 0.0001 
LONG 0.27982 167 0.06777048 1807.41517922 17.05 0.0001 
THE ABOVE MODEL IS THE BSEST 4 VARIABLE MODEL FOUND. 
STEP 5 VARIABLE DEPTH ENTERED R SQUARE = 0.01469015 CCP) = 8 .26374200 
DF SUM OF SQUARES MEAN SQUARE F PROB>F 
REGRESSION 5 8138.31471993 1627 .66294399 15.37 0.0001 
ERROR 5153 545859 .8477 1465 105.93049635 
TOTAL 5158 553998. 16243458 
B VALUE STD ERROR TYPE II SS F PROB>F 
INTERCEPT ~42.98972178 
DEPTH 0.00065336 0.00028599 552 .87299446 5.22 0.0224 
TEMP -0. 14305737 0 .02678894 3020 .85992427 28.52 0.0001 
WINDSPD 0.07104589 0.02142959 1164.31700013 10.99 0.0009 
LAT 0.55093635 0. 12472545 2066 .87255949 19.51 0.9001 
LONG 0.32076879 0.07007375 2219.70087702 20.95 0.0001 
THE ABOVE MODEL IS THE BEST 5 VARIABLE MODEL FOUND. 
STEP 6 VARI#” PRESSURE ENTERED R SQUARE = 0.01502956 CCP) = 8.48791026 
DF SUM OF SQUARES MEAN SQUARE F PROB>F 
REGRESSION 6 8326 . 34685402 1387 .72447567 13.10 0.0001 
ERROR 5152 545671.81558056 105.91456048 
TOTAL 5158 553998. 16243458 
B VALUE STD ERROR TYPE II SS F PROB>F 
INTERCEPT -70.21156180 
DEPTH 0.00067086 0.00028627 581.66642987 5.49 0.0191 
TEMP 0. 15483344 0.02820731 3191.25783774 30.13 0.0001 
PRESSURE 0.02582141 0.01937946 188.032 13408 1.78 0.1828 
WINDSPD 0.07594348 0.02174096 1292.34754759 12.20 0.0005 
LAT 0 .55832494 0. 12483929 2118 .49360859 20.90 0.0001 
LONG 0.33123782 0.07050764 2337 .56176301 22.07 0.0001 
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SAS 

~ ANOVA 
AUTOREG 
CANCORR 
CHART 
CLUSTER 
CORR 
DISCRIM 
DUNCAN 
EDITOR 
FACTOR 
FREQ 
FUNCAT 
GLM 


GUTTMAN 
MATRIX 
MEANS 
MERGE 
NEIGHBOR 
NESTED 
NLIN 
NPAR 1WAY 
PLAN 
PLOT 
PROBIT 
RANK 
RSQUARE 
SCORE 
SORT 
SPECTRA 
STANDARD 
STEPWISE 
SUMMARY 
SYSREG 


TTEST 


UNIVARIATE 


UPDATE 
VARCOMP 


Table 5. 


Statistical procedures and data manage- 


ment capabilities available in SAS and BMDP. 


analysis of variance for balanced data 

autoregression 

canonical correlation 

histograms, pie charts, star charts 

cluster analysis 

correlation analysis 

discriminant analysis 

Duncan's multiple range test 

interactive and batch editing of data sets 

factor analysis 

frequency and crosstabulation tables 

categorical analysis 

simple and multiple regression, analysis of variance for 
unbalanced data, analysis of covariance, response surface 
models, weighted regression, polynomial regression, partial 
correlations, multivariate analysis of variance 

Guttman scaling 

matrix language 

means and other descriptive statistics 

merges data sets to add variables to observations 
nearest neighbor discriminant analysis 

nested analysis of variance 

noniinear regression : 

One-way analysis of variance on ranks 

randomized plans for experiments 

printer plotting 

probit analysis 

ranking 

all possible regressions 

linear combinations of coefficients and data values 
sorts data sets by variables specified 

Spectral analysis 

standardization 

stepwise regression 

summary statistics 

ordinary least squares, two-stage least squares, limited 
information maximum likelihood, three-stage least squares, 
seemingly unrelated regressions 

t-tests 

univariate descriptive statistics, including percentiles 
updates a master file 

variance component analysis 


Plus report-writing capabilities and a built-in programning language 
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Table 5 (Concluded) 


univariate statistics 

univariate statistics, including frequencies 
t-tests 

single column frequencies--numeric and nonnumeric 
histograms and univariate plots 

bivariate (scatter) plots 

description of groups (strata) with histograms and 
analysis of variance 

missing value correlation 

multiway description of groups 

two-way frequency tables--measures of association 
two-way frequency tables--empty cells and departures 
from independence 

multiway frequency tables--the log-linear model 

life tables and survival functions 

cluster analysis of variables 

cluster analysis of cases 

block clustering 

factor analysis 

canonical correlation analysis 

Stepwise discriminant analysis 

description and estimation of missing data 

multiple linear regression 

stepwise regression 

nonlinear regression 

regression on principal components 

polynomial regresssion 

partial correlation and multivariate regression 

all possible subsets regression 

derivative-free nonlinear regression 

multipass transformation 

nonparametric statistics 

one-way analysis of variance and covariance 
analysis of variance and covariance, including repeated 
measures 

general mixed model analysis of variance 
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FUTURE DEVELOPMENT 


A data base management system has been developed for coastal 
and marine bird data. However, with any system there is a continuing 
need for maintenance. This includes quality control and entry of 
new data, maintaining backup copies, answering user inquiries, and 
making small changes required by modifications in the computer system 
or new versions of SAS. In addition to maintenance, the data base 
must have the capacity to evolve, meeting the changing needs of the 
users. Features such as the preprocessor may be modified to handle 
these requests as new demands are placed on the system. 


This coastal and marine bird data base has been developed to 
cover data needs in the foreseeable future. Retrieval systems can 
be modified to meet local specific needs or expanded to meet national 
needs. Users might suggest modification or more efficient ways of 
using the system. 


It is likely that some investigations will require diverse data 
from different data bases to study broad interrelationships. Information 
Specialists should investigate methods which assure the smooth transfer 
of data between data bases. 


Decisions need to be made concerning location and types of 
systems available. A central system serving the Nation will allow 
all users to deal with one information center. Regional systems can 
be more localized, providing more information on specific locales 
within the region. However, because of the nature of these smaller 
systems, additional expense will result from the duplication of 
| and national data will not necessarily be available 

ocally. 





The need for a data storage and retrieval system for coastal and 
marine birds exists. Time and costs necessary to evaluate this important 
resource can be reduced appreciably by having an effective system and 
using reliable data. These results can provide us with better information 
for the conservation of coastal and marine birds. 


The system can be used for other groups of species or forms of 
habitat data without any modification. Programmers can add new features 
by making the necessary modifications in the printouts listed in 
Appendix C. 
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APPENDTX A 
DATA STORAGE AND REYRIEVAL SYSTEMS 


Organizing, storing, retrieving, and rearranging data efficiently 
are important problems in computer science. It is generally recognized 
that a data base management system is needed to effectively deal with any 
complex, multiuser set of data. Martin (1977:22) defines a data base as 
".eea collection of interrelated data stored without harmful or unnecessary 
redundancy to serve multiple applications; the data are stored so that 
they are independent of programs which use the data; a common and 
controlled approach is used in adding new data and in modifying and 
retrieving existing data within the data base. The data are structured 
so as to provide a foundation for future application development." 

Thus, a data base implies much more than simply a collection of data. 


CHARACTERISTICS OF DATA BASES 


Several essential features of a data base described by Martin 
(1977) are: 


(1) Controlled redundancy. Historically, each application kept 
an individual file of data, duplicating data for other applications. 
Multiple updates were required to add or correct data, and some of 
the copies occasionally failed to be updated. Inconsistent answers 
were sometimes obtained because different copies were in different 
stages of revision. Multiple updates and duplicate storage ultimately 
added to the cost of data processing. 


(2) Interconnectedness. A data base can, to a large extent, 
show the relationships among variables. The data are oryanized to 
exploit the relationships among the variables, to answer diverse and 
unanticipated questions, instead of being organized into independent 
application-oriented files which are always used in the same way. 

A data base is not just a collection of unrelated variables but is a 
reservoir of information which is shared by all the applications. 


(3) Data independence. Constant change and growth are character- 
istic of most data bases. It must be possible to restructure a data 
base without difficulty. However, it is important that any restructuring 
not affect existing application programs. In fact,the users do not 
have to know that a change has taken place if they do not need any 
new features of the data base. Without data independence, seemingly 
trivial changes to files may require major programming effort to 
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modify a}1 the programs using the files. The name and description of 
the variables are stored in the data base to achieve data independence. 
As a consequence, the user can refer to the variables by name without 
being concerned about storage methodology in the data base. 


(4) Security protection. Security is critical when personal 
information is stored. There is a similar concern with bird data when, 
for example, the nesting locations of endangered species need to be 
protected from unauthorized access. It is essential to protect the 
integrity of the data by requiring a password to delete, change, or add 
to the data base. Backup copies are also needed in the event of an 
accidental loss of data. 


In addition to these essential features, several other considerations 
are necessary for an effective scientific data base management system 
(Anderson 1977; Martin 1977). 


(5) Retrieval language. A convenient language should be available 
to retrieve any required combination of variables and records. It 
should be possible to construct any set of relationships among 
the variables, even though these relationships were not anticipated 
when the data base was created. The language should be useable by 
biologists who have limited programming experience. 


(6) Report writer. A flexible and easy-to-use report writing 
capability is needed to produce the tables, graphs, maps, and figures. 


(7) Statistical package. A well-designed and well-supported 
Statistical package must be available for data analysis. Sometimes 
a simple table produced by the report writer will be sufficient. At 
other times, a more thorough analysis of the data will be necessary 
in order to draw conclusions or make predictions. When data are 
presented to a statistical package, it is important to include 
variable names and descriptions so that the laborious and error 
prone task of reentering the information is avoided. Statistical 
packages require flat files (two-dimensional tables) and it may be 
necessary to convert more complex file structures to this form. 


(8) Appropriate response time. Many commercial situations such 
as booking airline seats and answering customer inquiries require a 
real-time system to provide quick, random access to file information. 
However, jobs such as processing payrolls are best done by sequentially 
Scanning through files, which accomplishes al] the updating and 
retrieving in one or more passes. This method is useful when fast 
response time is not needed and infrequent file usage makes on-line 
storage unnecessary. Savings are also possible by avoiding the 
cost of maintaining the structures which make random access possible, 
like inverted files for multiple key retrievals. 


One guide for determining the use of direct access or sequential 
methods is the file activity ratio, which is defined as [(number of 
records used in one job)/(number of records scanned in that job)]. 
if the file activity ratio is large, then the job may be done using 
Sequential processing methods in about the same time as using direct 
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access methods. If the file activity ratio is low, then direct 
access methods can skip unwanted records, avoiding lengthy scanning 
Operations. In most scientific situations, the file activity ratio 
is large, and sequential processing methods should be used to avoid 
expensive on-line storage and software maintenance. 


(9) Complex structures. Scientific data are often complex and 
require elaborate data structures for representation. Hierarchical 
data structures are particularly important, but more complex network 
structures may be required. 


(10) Data types. It is necessary for the system to distinguish 
between zero (search conducted but no birds found) and missing data 
(no search conducted). The system should also be able to handle 
integers, real numbers, character strings, dates, and time as 
distinguishable data types. Storage allocation should not waste 
Space by using more storage than necessary for a particular data 
type (e.g., using 8 bytes for a 1-byte integer). The use of mnemonic 
abbreviations instead of code numbers prevents many errors by a 
self-checking process. 


(11) Naming of variables. Flexibility in naming and labeling 
variables is needed as an integral part of the software. Both verbal 
descriptions of variables for the user and descriptions of the variable's 
type and location for the computer should be stored. This procedure 
minimizes confusion which may lead to an incorrect analysis and also 
improves communication among individuals using the data base. 


(12) Editing. A flexible way of easily checking input data 
for errors is required. For example, data should be within a specified 
range or a member of a set of allowable categories. Consistency 
checks among variables should also be possible. If data fail one or 
more of these tests, the user should be notified so that the accuracy 
of the complete data se. can be verified and corrected. 


(13) Updating. A flexible means of adding, deleting, and correcting 
values of variables and whole records is needed. For convenience, it is 
important to have both interactive direct access and batch methods available. 


(14) Evolution. The system must be capable of smoothly evolving to 
meet future demands while remaining compatible with those of the past. 
This is especially true in a scientific environment where information 
from divergent sources must be accommodated and where future needs cannot 
be predicted. Both physical and logical data independence is required 
So that processing programs and user requests are unaffected by changes in 
either the physical storage or the logical organization of the data. 


(15) Minimal cost. Minimal cost to achieve the objectives is 
important. 
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DATA BASE MANAGEMENT SYSTEMS 


Sophisticated commercial data base management systems such as 
SYSTEM 2000, which implement all the essential features of a data 
base (controlled redundancy, interconnectiveness, data independence, 
and security), have been developed. They have excellent retrieval 
languages, report writing capabilities, convenient editing and updating 
facilities, and usually write a data file which can be read by a 
Statistical system. Normally, however, they do not interface to any 
statistical system. None of the variable names and descriptions, 
missing values designations, or other relevant information is transmitted 
with the data. Consequently, many of the advantages of a data base 
management system are lost. These systems generally have no way of 
distinguishing a zero from a missing value. Such systems are able 
to represent complex tree and plex structures and give quick responses 
to inquiries from terminals by using an inverted file structure. 
They do, however, require the use of high cost on-line disk storage 
and the maintenance of complex inverted file structures. These 
Structures make even simple changes to the data base quite costly, 
thereby inhibiting the smooth evolution of the system. 


Another approach is to develop a custom data base management system 
for a particular application. For example, Fox (1977, 1978) has developed 
a clinical information system. A specialized Data Definition Language 

is used to specify the structure of the data and the retrieval. This 
language is compiled into a PL/I program which is link-edited into the 
nucleus of the system. The advantage of such a system is that it can 

be designed to meet all the specified requirements of a particular data 
base. One disadvantage is the high development cost, which cannot be 
shared by other institutions. Kirk and Bragg (1979:315) reported that 
after 2 years of development of a custom data base system in FORTRAN 

and PL/I, “The system was not adopted due to the lack of data management 
and flexible report generating capabilities." When they based the system 
On a general purpose statistical package (SAS), a useful and flexible 
prototype information system was implemented within 3 months. 





Relational data bases based on the mathematical theory of relations 
are one form of data organization (Ingram 1979; Teitel 1977; Martin 1977). 
The most natural way to represent data is with a two-dimensional table 
also known as a flat file, yet the most complex data structures can 
be represented by a series of flat files. 


A relational data base is composed of a series of flat files that 
can be manipulated to incorporate any complex data structure. Their 
advantages are (a) They can be easily understood by users with no 
training in programming. The basic simplicity of a relational data base 
consisting of flat files (tables) avoids the organizational complexity 
associated with tree and plex structures while representing complex logical 
structures. (b) It is easy for the data base to evolve in order to meet 
Changing requirements without the overhead associated with tree and plex 
Structures. New variables are accommodated easily. (c) They permit the 
maximum flexibility in handling unanticipated requests because the logical 
relationships among the variables are not specified until the retrieval 
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request is made. (d) The cost of implementing and maintaining the system 
can be minimal because of the simplicity of a relational data base. 


Originally statistical p-ckages were simply collections of programs 
for doing various statistical analyses. However, some have evolved into 
data base management systems because of the need to manage the scientific 
data required for the statistical analyses. The Statistical Analysis 
System (SAS) (Barr et al. 1979) has the essential features of a data 
base management system: (1) The data files can be shared by all users 
so redundancy can be controlled; (2) SAS supports a relational data 
base so that any required relationships between the variables can be 
represented; (3) variables are referenced by name, and a descripiion 
(both verbal for the user and code for the computer) is stored with the 
data; and (4) separate passwords are available for reading, writing, or 
modifying data thereby meeting the need for data security. 


SAS has most of the other features needed for data base management. 
It has a convenient retrieval language and a flexible report writing 
capacity. A wide range of statistical procedures is available, both 
within SAS and through an interface within the BMDP statistical 
package (Dixon and Brown 1979). SAS supports the required data 
types, including dates, times, and missing value recognition as well 
as the editing and updating functions. Mnemonic abbreviations can 
be used instead of code numbers for identification, and SAS contains 
a PL/I-like programming language which can be used to manipulate the 
data and check input information. Statements are available for 
sorting, merging, and updating files. In addition, it is a general 
purpose statistical and relational data base management system which 
places few restrictions on the evolution of the data base. The cost 
is reasonable because data can be stored less expensively on tape 
than on disks. 


SAS can accommodate any arbitrarily complex data structure by 
Supporting a relational data base. It directly handles. a hierarchical 
Structure by merging levels, two at a time, and associating information 
from the higher level with each record from the lower level. Scientific 
data base management systems such as the Scientific Information Retrieval 
(SIR) system (Anderson et al. 1976, 1978) have been developed to manage 
hierarchical files effectively. Although SIR provides a full range of 
data base management functions, most of them duplicate functions avail- 
able in SAS. On the other hand, SAS can avoid these inefficiencies 
and accomplish a multi-level merge in one pass of the data if additional 
programming statements are used (Ellis 1978). 
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APPENDIX B 
RETRIEVAL LANGUAGE USER'S GUIDE 


The purpose of a retrieval program is to create a SAS data set 
containing selected information available from the data base. A 
MENU listing all the variables currently stored in the data base is 
available to the user on disk (Fig. 5). It contains a description 
of the variables and is used to decide which variables should be 


retrieved. 


After a user has decided which variables he wishes to retrieve, 
he writes a retrieval program. For example, the following retrieval 
program will build and print a data set with the latitude (in degrees), 
longitude (in degrees), and the numbers of Cory's Shearwaters (Puffinus 
diomedea) (COSH) and Northern Fulmars (Fulmarus glacialis) (NOFU) observed. 











// EXEC RETRIEVE 
VARIABLES LATD LONGD COSH NOFU; 
KEEP = '‘YES'; 
//SAS.PROG DD * 
PROC PRINT; . 
Other programs are illustrated in Figures 6 and 7. 





The lines starting with // are Job Control Language (JCL) statements. 
The first JCL statement calls the JCL procedure RETRIEVE which translates 
the retrieval program into SAS statements and then executes them. 
This statement also allows the user to select whether he wants to 
output one record for each observation (// EXEC RETRIEVE ,RECORD=OBSERVATION) 
or one record for each species observed (// EXEC RETRIEVE,RECORD=SPECIES). 
A comparison of Table 6 and Figures 6 and 7 illustrates the distinction. If 
neither is specified, one record for each observation will be output by 
default. The words OBSERVATION and SPECIES may be abbreviated because 
only the first letter is checked. 


The retrieval program is placed between the first and second JCL 
statements. The second JCL statement (//SAS.PROG DD *) indicates 
the end of the retrieval program and the start of the user's SAS 
statements to print, analyze, or both, the retrieved data. A JOB 
card giving the user's accounting information and the JCL instream 
procedure (Appendix C) is required to run a retrieval program. 


In general, a retrieval program consists of a DATA statement, 
a VARIABLE statement, and one or more select statements. Each 
Statement is terminated by a semicolon. A line of the retrieval 
program may be a maximum of 72 characters iong and contain more than 
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222222 

wexeeeKE VARTABLES IN MAIN FILE (NO INITIALIZATION ALLOWED) 

LATD LATITUDE IN DEGREES 

LONGD LONGITUDE IN DEGREES 

weeKKKEE VARTABLES IN SPECIES FILE (NO INITIALIZATION ALLOWED) 

SPECIES FOUR LETTER SPECIES ABBREVIATION 

NUMBER NUMBER OF INDIVIDUALS OBSERVED 

wee keREE VARTABLES IN NUMERIC VARIABLE FILE REFERRING TO MAIN FILE 
DEPTH DEPTH OF WATER IN METERS 

WIND WIND SPEED IN MPH 

weeKKKKK VARTABLES IN CHARACTER VARIABLE FILE REFERRING TO MAIN FILE 
SEA SEA STATE, R=ROUGH, C=CALM 

VESSEL VESSEL TYPE CODE 

*eRREEEE VARIABLES IN NUMERIC VARIABLES FILE REFERRING TO SPECIES FILE 
NADULT NUMBER OF ADULTS 

NIMM NUMBER OF IMMATURES 

weeRKEEX VARTABLES IN CHARACTER VARIABLE FILE REFERRING TO SPECIES FILE 
OIL Y(YES) IF BIRD IS OILED, N(NO) IF BIRD IS NOT OILED 

BEH BEHAVIOR CODE 


Figure 5. A MENU as it would appear on the user's terminal 

giving the names of the variables in the data base and their 
descriptions. Any variables not in the list are assumed to be species 
abbreviations. The numbers on the top line tell the computer how many@ 
variables there are for each group. 
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Table 6. Data in the four files (MAIN, SPECIES, NUMVAR, and CHARVAR) that are used in the examples (Figs. 6 
and 7). The data are arranged so that each line represents a pass through the retrieval program. 























MAIN SPECIES NUMVAR CHARVAR out! 
KEY KEYl= LATD LONGD KEY KEYl= NUMBER KEY KEYl= VAR VALUE KEY KEYl= CVAR  CVALUE 
___ SPECIES ____ SPECIES ___ SPECIES __—s«s§_—s—S_SREC TES 
1 ° 41 68 1  * DEPTH 50 1 : SEA R 
1 * WIND 5 1 VESSEL C 
1 COSH 2 1 COSH  NADULT 1 1  COSH OIL N 
1 COSH NIM 1 1 COSH -BEH F S 
° 1 NOFU 1 1  NOFU OIL N $0 
2° 42 69 2  * — DEPTH 30 2 SEA c 
2 COSH 1 2 COSH OIL Y S 
2 -HEGU 9 $0 
3° 39 68 3. * “DEPTH 40 3 , SEA R 
3. -NOFU 2 $0 
11f RECORD=SPECIES thy” cord will be output at the lines indicated by S, while if RECORD=OBSERVATION a record 


will be output at the | .s indicated by 0. 


























// EXEC RETRIEVE ,RECORD=OBSERVAT ION 

DATA A; 

VAR LATD LONGD DEPTH WIND SEA VESSEL COSH NOFU; 
IF 40<LATD<=45 & 65<LONGD<=70 then KEEP='YES'; 
//SAS.PROG DD * 

PROC PRINT; 


LATD LONGD DEPTH WIND SEA VESSEL COSH NOFU 
41 68 50 5 R C 2 l 
42 69 30 ° C l 0 


Figure 6. Example of a retrieval program with one record for each 
observation, using the MENU given in Figure 5 and the data given in 
Table 6. 
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// EXEC RETRIEVE ,RECORD=SPECIES 
//SAS.OUT DD DSN=SASDATA,DISP=(NEW,KEEP) ,VOL=SER=FILE18,UNIT=DISK, 
// SPACE=(TRK, (10,10) ,RLSE) 


DATA OUT.B; 
VAR LATD LONGD SPECIES NUMBER NADULT NIMM DEPTH SEA OIL (BEH='U'); 


IF SPECIES='COSH' OR SPECIES="NOFU' THEN KEEP=‘YES'; 
//SAS.PROG DD * 
PROC PRINT; 


LATD LONGD SPECIES NUMBER NADULT NIMM _ DEPTH SEA OIL BEH 
41 68 COSH 2 l 1 50 R N F 
41 68 NOFU l . . 50 R N U 
42 69 COSH l ’ : 30 C Y U 
39 68 NOFU 2 : : 40 R ° U 


Figure 7. Example of a retrieval program with one record for each species 
observed, using the MENU given in Figure 5 and the data given in Table 6. 
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One statement. A statement may be continued on the next line if it 
is broken at a point where a blank would normally appear. 


A DATA statement gives a name to the data set that is being 
created by the retrieval program. It has the form "DATA <data set 
name>;" where the <data set name> is any allowable SAS data set name. 
If the DATA statement is omitted, the default data set name is WORK.WORK. 


A VARIABLE statement specifies the variables that are to be included 
in the retrieved data set. It has the form “VARIABLES <list of variables>;" 
where <list of variables> is a list from the MENU separated by blanks. 
The word VARIABLES may be abbreviated VAR. By default these variables 
will be initialized to missing values if they are numeric variables 
or to blanks if they are character variables, if no value exists for 
them in the data base. The user can initialize them to some other 
value by enclosing them in parentheses and specifying the initial 
value using the form "(<variable>=<value>)". For example, one could 
write "VARIABLES LATD LONGD (WIND=0) (SEA='U');" to initialize LATD 
and LONGD to missing values by default, WIND to 0, and SEA to ‘U'. 

The initialization of a variable must be less than 33 characters 

long including the left and right parentheses. Any of the standard 
four-letter species abbreviations (Klimkiewicz and Robbins 1978) 

may also be included in the <list of variables>. Any variables in 
the list which are not in the MENU are assumed to be species abbrevia- 
tions. These variables which list the number of individuals of 

each species are initialized to zero. They are meaningful only when 
one record is output for each observation. No more than 100 variables 
may appear in the VARIABLES statement and, of these, no more than 20 
may be initialized. 


Select statements are any valid SAS program statements (not SAS 
procedures) which set the variable KEEP='YES' if the user wants to include 
the current record in the new data set. Otherwise the record will not be 
selected. Several select statements can be used. All variables refer- 
enced in select statements should be included in the <list of variables> 
to assure that they will be available to the program. 


The example of a retrieval program in Figure 6 outputs one record 
for each observation. A temporary data set named A is created containing 
the variables LATD, LONGD, DEPTH, WIND, SEA, VESSEL, COSH, and NOFU 
from the data base given in Table 6. These variables are described 
in the MENU (Fig. 5) except for COSH and NOFU which are the number 
of Cory's Shearwaters and Northern Fulmars observed. All observations 
in the block from 41° to 45° latitude and 66° to 70° longitude are 
selected. 


The example of a retrieval program in Figure 7 outputs cne record 
for each species observed. A permanent data set named B is created 
in the data set identified on the JCL Data Definition (DD) statement 
with the DDNAME OUT (in the SAS step of the RETRIEVE procedure). This 
data set contains the variables LATD, LONGD, SPECIES, NUMBER, NADULT, 
NIMM, DEPTH, SEA, OIL, and BEH, which have the value U (unknown) if no 
value exists for them in the data base (Table 6). Records for the species 
COSH (Cory's Shearwater) and NOFU (Northern Fulmar) are selected. 
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APPENDIX C 


DESCRIPTION OF THE RETRIEVAL PROCESS 
AND LISTING OF TRANSLATOR PROGRAM 


The RETRIEVE Job Control Language (JCL) instream procedure 
contains two steps (Fig. 9). The first step (EXPAND) executes a 
PL/I program which translates the user's retrieval program into a 
SAS retrieval program which it writes into a temporary file (&&SASIN). 
The symbolic parameter RECORD is passed to the translator program to 
indicate whether a record is to be output for each observation period 
(the default) or for each species observed during that period. The 
MENU file contains a list of variables in the data base which the 
program uses to determine the type of each variable (Fig. 5). 
Users can list this file at a terminal to obtain a description of 
the variables in the data base. The second step executes the SAS 
system, with the SAS retrieval program generated in the first step 
concatenated in front of the user's SAS program to print or analyze 
the retrieved data, or both. There are four data files (MAIN, SPECIES, 
NUMVAR, AND CHARVAR) which must be set up before the retrieval. 
Each file must have the variables KEY and KEY1 and be sorted in 
order of those variables. KEY uniquely identifies the observation, 
while KEY1 uniquely identifies the species. KEY1 is blank if a record 
in one of the files refers to all species observed during a particular 
observation period. 


The general form of the SAS retrieval program generated by the 
user's retrieval program is given in Figure 10. Each of the variables 
listed in the VARIABLES statement is identified to type by a table 
look up based on the MENU (Fig. 5). Any variable not in the table 
is assumed to be a <species abbreviation>. The variable types are 
as follows: <main> are variables from the MAIN file that contain 
variables such as latitude and longitude which apply to all the 
Species observed. Because they are at the first level of the hierarchy, 
these variables are always present and do not need to be initialized. 
<Main numeric> and <main character> are similar variables which 
apply to all species observed but which are only occasionally recorded. 
To save storage space they are stored at the second level of the 
hierarchy in the NUMVAR and CHARVAR files respectively. <Species> 
are variables such as the species observed and the number observed 
which apply to each species observed and which are usually recorded. 
They are stored in the SPECIES file at the second level of the hierarchy, 
so that storage space is not re..: °d for species not observed. 
<Species numeric> and <species cter> variables are similar to 
the <species> variables but they ar: only occasionally recorded. 

They are stored at the third level of the hierarchy in the NUMVAR 
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//RETRIEVE PROC RECORD=OBSERVATION 
//EXPAND EXEC PGM=RL2,PARM='/&RECORD' ,REGION=400K 

//STEPLIB DD DSN=WAL3PHG.DBO215L. PREPROS , VOL=SER=FILE18,UNIT=FILE, 
//  DISP=SHR 

//SYSPRINT DD SYSOUT=A 

//IN DD DDNAME=SYSIN 

//OUT DD DSN=&&SASIN, UNIT=SYSDA, DISP=(NEW,PASS), 

// SPACE=(TRK, (50,50) ,RLSE) ,DCB=(RECFM=FB,LRECL=80 ,BLKSIZE=3120) 
//MENU DD DSN=WAL3PHG.DB0215D.EXMENU, VOL=SER=FILE18,UNIT=FILE ,DISP=OLD 
//SAS EXEC PGM=SAS,PARM='S=72 LS=131 NOCENTER NOOVP' ,REGION=300k 
//LIBRARY DD UNIT=SYSDA,DSN=&&LIBRARY ,SPACE=(TRK, (30, ,30)) 
//STEPLIB DD UNIT=FILE ,DISP=SHR , VOL=SER=MSTATI, 

// DSN=ZABCRUN.SAS792.L IBRARY 

// DD DSN=SYS1.PLIBASE ,DISP=SHR 

/ /WORK DD UNIT=SYSDA,SPACE=(CYL,(20,10)) TEMPORARY DATA SETS 
//FT11F001 DD SYSOUT=A LOG 

//FT12F001 DD SYSOUT=A PRINT 

//FT13F001 DD SYSOUT=B PUNCH 

//FT15F001 DD UNIT=SYSDA,SPACE=(CYL,(1,1)), PARMCARDS 

/ DCB=(RECFM=FB, LRECL=80 ,BLKSI ZE=800,BUFNO=1 ) 
//SORTMSGS DD SYSOUT=A 

//SORTLIB DD DSN=SYS1.SORTLIB,DISP=SHR 

//SORTWKO1 DD UNIT=SYSDA,SPACE=(CYL,(10) ,, CONTIG) 

//SORTWKO2 DD UNIT=SYSDA,SPACE=(CYL, (10), ,CONTIG) 

//SORTWKO3 DD UNIT=SYSDA, SPACE=(CYL, (10), ,CONTIG) 

//MAIN — DD DSN=WAL3DKD.MANMET78.MAIN1,VOL=SER=FILE45, 

// _~UNIT=FILE,DISP=SHR 

//SPECIES DD DSN=WAL3DKD.MANMET78. SPECIES1 ,VOL=SER=FILE45, 

// _~UNIT=FILE,DISP=SHR 

//NUMVAR DD DSN=WAL3DKD.MANMET78.NUMVAR1,VOL=SER=FILE45, 

// _~UNIT=FILE ,DISP=SHR 

//CHARVAR DD DSN=WAL3DKD.MANMET78. CHARVAR1 , VOL=SER=FILE45, 

// _~UNIT=FILE,DISP=SHR 

//SYSIN DD DSN=&&SASIN,DISP=(OLD,DELETE) 

// DD DDNAME=PROG 

// PEND 


Figure 8. Job Control Language instream procedure for retrieving data 
and executing SAS for analysis. 
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DATA <data set name> (KEEP=<main> <main numeric> <main character> <species> 
<species numeric> <species character> <species abbreviation>); 
MERGE MAIN.MAIN1 SPECIES.SPECIES1 ‘NUMVAR.NUMVAR1 CHARVAR.CHARVAR1; 
BY KEY KEY1; 
RETAIN KEEP <main> <main numeric> <main character> <species> <species numeric> 
<species character> <species abbreviation>; 
IF FIRST.KEY THEN DO; 
[KEEP='NO';*IF RECORD=OBSERVATION; ] 
<main> = T||<main>; 
<species abbreviation> = 0; 
<main numeric> = .; *OR=USER SPECIFIED VALUE; 
<main character> = ' '; *OR=USER SPECIFIED VALUE; 
END; 
IF FIRST.KEY1 & KEY1 NOT=' ‘ THEN DO; 
[KEEP='NO';*IF RECORD=SPECIES; ] 
SPECIES=KEY1; 
<species numeric> = .; *OR=USER SPECIFIED VALUE; 
<species character> = ' '; *OR=USER SPECIFIED VALUE; 
END; 
IF VAR='<main numeric>'THEN <main numeric>=VALUE ;ELSE 
IF VAR='<species numeric>'THEN <species numeric>=VALUE ;ELSE; 
IF CVAR='<main character>'THEN <main character>=CVALUE ;ELSE 
IF CVAR='<species character>'THEN <species character>=CVALUE ;ELSE ; 
IF SPECIES='<species abbreviation>'THEN <species abbreviat ion>=NUMBER ;ELSE; 
<select> 
[IF LAST.KEY & KEEP='YES' THEN OUTPUT; *IF RECORD=OBSERVATION; ] 
[IF LAST.KEY1 & KEY] NOT=' ' & KEEP='YES' THEN OUTPUT: *IF RECORD=SPECIES; ] 





Figure 9. General form of a SAS retrieval program produced from the user's 
retrieval program. Capital letters represent fixed parts of the program, 
while small letters represent variable parts. 
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and CHARVAR files. Note that both <main numeric> and <species numeric> 
variables are stored in the NUMVAR file. They are distinguished by 

KEY1 which is blank if the variable refers to the MAIN file and 

which contains the species abbreviation if the variable refers to the 
SPECIES file. <Main character> and <species character> variables in 

the CHARVAR file are similarly distinguished. <Species abbreviation> 
variables contain the numbers of individuals observed of the specified 
Species when one record is output for each observation. The four-letter 
_pecies abbreviations are used for the names of these variables. 

Unlike other variables, <species abbreviation> variables are initialized 
to zero because,if a species was not observed, then there was zero present. 


The files MAIN, SPECIES, NUMVAR, and CHARVAR are merged by KEY 
(which identifies the observation) and KEY1 (which identifies the 
species). Table 6 illustrates this merge with each line representing 
a pass through the retrieval program resulting from a matching of the 
BY variables (KEY and KEY1). Obviously, one would not want to output 
a record for each line. This means that the variables must be RETAINed 
to keep SAS from automatically initializing them to missing values 
before each line in Table 6. 


Variables need to be initialized at two‘different points in the 
merge. <Main numeric>, <main character>, and <species abbreviation> 
variables pertain to all species in an observation and thus are 
initialized when KEY changes (FIRST.KEY is true). Main variables 
are renamed (<main>=T||<main>;) at this point to retain their values 
when SAS automatically assigns missing values to them in the other 
lines of the merge (Table 6). <Species numeric> and <species character> 
variables pertain to each species observed and thus are initialized 
when either KEY or KEY] changes (FIRST.KEY1 is true). If one record 
is required for each observation, the variable KEEP is initialized 
when KEY changes, but if one record is required for each species, 

KEEP is initialized when either KEY or KEY1 changes. KEEP is used to 
select the records which are to be kept (output). 


In the next section of the program, values are assigned to the 
variables. For exampie, the names of variables stored in the NUMVAR 
file are stored in the variable VAR while their values are stored in 
7 variable VALUE. The value is assigned to a variable in a statement 

ike 


IF VAR='WIND' THEN WIND=VALUE;,. 


These statements are followed by the user's <select> statements, which 
-set the variable KEEP='YES' if the record is to be output. Note 

that variables referenced by the <select> statements must be available 
to the program at the time the <select> statements are executed. 

All variables listed in the VARIABLES statement are available to the 
program at that time along with some other variables. Finally, a 
record is output if required. The lines on which output occurs (if 
KEEP='YES') are marked in Table 6. The SAS retrieval programs 
corresponding to the examples in Figures 6 and 7 are given in Figures 
10 and 11. 
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as 


DATA A (KEEP= LATD LONGD DEPTH WIND SEA VESSEL COSH NOFU ); 
MERGE MAIN.MAIN] SPECIES.SPECIES1 NUMVAR.NUMVAR1 CHARVAR.CHARVARI; 
BY KEY KEY]; 

RETAIN KEEP LATD LONGD DEPTH WIND SEA VESSEL COSH NOFU ; 

IF FIRST.KEY THEN DO; 

KEEP='NO '; 

LATD=TLATD; 

LONGD=TLONGD ; 

COSH=0; 

NOFU=0; 

DEPTH=. ; 

WIND=.; 

SEA=' rs 

VESSEL=' *s 

END; 

IF FIRST.KEY1 & KEY1 NOT=' ' THEN DO; 

SPECIES=KEY1; 

END; 

IF VAR="DEPTH' THEN DEPTH=VALUE; ELSE 

IF VAR='WIND' THEN WIND=VALUE; ELSE 


IF CVAR='SEA' THEN SEA=CVALUE; ELSE 
IF CVAR="VESSEL' THEN VESSEL=CVALUE; ELSE 


IF SPECIES="COSH' THEN COSH=NUMBER; ELSE 
IF SPECIES="NOFU' THEN NOFU=NUMBER; ELSE 


IF 40<ILATD<=45 & 65<LONGD<=70 THEN KEEP='YES'; 
IF LAST.KEY & KEEP="YES' THEN OUTPUT; 


Figure 10. SAS retrieval program generated from the user's retrieval 
program in Figure 6. 
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~~-¢ vaaiag (KEEP= LATD LONGD SPECIES NUMBER DEPTH SEA NADULT NIMM OIL 
BEH ); 

MERGE MAIN.MAIN1 SPECIES.SPECIES] NUMVAR.NUMVAR1 CHARVAR.CHARVAR1; 

BY KEY KEY1; 

RETAIN KEEP LATD LONGD SPECIES NUMBER DEPTH SEA NADULT NIMM OIL BEH ; 
IF FIRST.KEY THEN DO; 

LATD=TLATD; 

LONGD=TLONGD ; 

DEPTH=.; 

SEA=' .. 

END; 

IF FIRST.KEY1 & KEY1 NOT=' * THEN DO; 

KEEP='NO '; 

SPECIES=KEY1; 


BEH='U'; 

END; 

IF VAR='DEPTH' THEN DEPTH=VALUE; ELSE 
IF VAR='"NADULT' THEN NADULT=VALUE; ELSE 
IF VAR='NIMM' THEN NIMM=VALUE; ELSE 


IF CVAR='SEA' THEN SEA=CVALUE; ELSE 
IF CVAR="OIL' THEN OIL=CVALUE; ELSE 
IF CVAR="BEH' THEN BEH=CVALUE; ELSE 


IF SPECIES="COSH' OR SPECIES="NOFU’ THEN KEEP='YES'; 
IF LAST.KEY1 & KEY1 NOT=' ' & KEEP="YES' THEN OUTPUT, 


Figure 11. SAS retrieval program generated from the user's retrieval 
program in Figure 7. 
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List 





ing of PL/I program that translates the user’s retrieval program 


into a SAS retrieval program follows. 


RL2: PROC(RECORD) OPTIONS (MAIN); /*&PHG.DB0215J.PREPROS ON FILE18 */ 


/* 


/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 


DCL (M SIZE,S SIZE,MN SIZE,MC SIZE,SN SIZE,SC_SIZE, 
NO DBVARS,I,J) FIXED BIN(15), 
RECORD CHAR(100) VAR, /*FOR EACH OBSERVATION OR EACH SPECIES*/ 
(VARR_INDEX, INIT VARR_ INDEX) FIXED BIN(15) INIT(1), 
EOF BIT(1) INIT(’0°B), /* EOF=0 MEANS END OF FILE NOT REACHED. */ 
KW3 CHAR(3), 
LINE CHAR(80), 
STMT CHAR(32767) VARYING, 
(A_SEL, BUFF) CHAR (32767) VARYING INIT(’’), 
VARR (100) CHAR(8) VARYING, 
INIT VARR(20) CHAR(30) VARYING, 
A DAT CHAR(17) VARYING INIT(’WORK’), 
(SUBSTR, VERIFY, INDEX,LENGTH,MIN) BUILTIN, 
(MENU, IN, OUT) EXTERNAL FILE, 
SYSPRINT EXTERNAL PRINT FILE; 
ON ENDFILE(IN) EOF="1°B; 


zkteeaeaeRRRRRRRARKRKRKRKRKRARRKRKRRRKRKRRKRRAKR KR K/ 
* / 

*#k* DEFINITIONS *** */ 

M SIZE: THE TOTAL NUMBER OF MAIN VARIABLES. * / 
S SIZE: THE TOTAL NUMBER OF SPECIES VARIABLES. * / 
MN SIZE: THE TOTAL NUMBER OF MAIN NUMERIC VARIABLES. & / 
MC SIZE: THE TOTAL NUMBER OF MAIN CHARACTER VARIABLES. * / 
SN_SIZE: THE TOTAL NUMBER OF SPECIES NUMERIC VARIABLES. * / 
SC _ SIZE: THE TOTAL NUMBER OF SPECIES CHARACTER VARIABLES. * / 
NO_DBVARS: THE TOTAL NUMBER OF VARS STORED IN THE DATA BASE. */ 
BUFF: BUFFER INTO WHICH THE USER’S RETRIEVAL PROGRAM IS READ. */ 
STMT: VARIABLE WHICH IS USED TO PARSE A DATA, VAR, OR SELECT */ 
STATEMENT FROM THE USER’S RETRIEVAL PROGRAM. * / 

EOF: BIT VARIABLE SIGNALING THE END OF THE USER’S PROGRAM. * / 


KW3: VARIABLE USED TO STORE THE FIRST THREE LETTERS OF A STMT. */ 


A DAT: VARIABLE USED TO STORE THE NAME OF THE DATA SET TO BE */ 
CREATED BY THE RETRIEVAL PROGRAM. * / 

VARR: ARRAY OF SIZE 100 WHICH STORES THE VARIABLES GIVEN IN * / 
THE VAR STATEMENT. */ 
INIT_VARR: ARRAY OF SIZE 20 WHICH STORES THE INITIALIZED */ 
VARIABLES GIVEN IN THE VAR STATEMENT. */ 

A SEL: VARIABLE USED TO STORE ALL OF THE SELECT STMTS OF A */ 
RETRIEVAL PROGRAM. * / 

* 

0066000000006 6eRcceecececsnceeesna 


GET FILE(MENU) LIST(M_SIZE,S SIZE,MN SIZE,MC_SIZE,SN SIZE,SC_ SIZE); 

NO_DBVARS = M SIZE + S SIZE + MN | SIZE + MC | SIZE + SN | | SIZE + SC_SIZE; 

PUT FILE (SYSPRINT ) EDIT((49)’* g +’ #7) (A(98),A)3 

PUT FILE (SYSPRINT) EDIT(’*’, *#”) (SKIP, A, X(97),A); 

PUT FILE (SYSPRINT ) EDIT(’*’,’R ETRIEVAL PROGRAM’,’#’) 
(SKIP, A, X(32),A(33),X(32),A); 
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PUT FILE(SYSPRINT) EDIT(’*’ ,”*’ ) (SKIP, A,X(97),A); 
PUT FILE(SYSPRINT) EDIT((49)’* ° ,°*’)(SKIP,A(98),A); 
PUT FILE(SYSPRINT) EDIT (’ *’ ,’*’ ) (SKIP,A,X(97),A); 
DO WHILE (EOF=’0°B); 
DO WHILE (INDEX(BUFF,°’;°)=0 & EOF=’0°B); 
CALL READ LINE; 
END ; 
DO WHILE (INDEX(BUFF,’;°)>0); 
CALL CUT_STMT; 
CALL PARSE STMT; 
END ; 
END; 
PUT FILE(SYSPRINT) EDIT (’ *’ ,’** )(SKIP,A,X(97),A)3 
PUT FILE(SYSPRINT) EDIT((49)°* °,°*’) (SKIP,A(98) ,A); 


jreueaeeeeeeRKRKRKRaeRRRRRHRRKRHRKRERRKREKRRKRRKRKRKR KEE 


/* READ LINE IS A PROC WHICH CONCATENATES THE NEXT 72 CHARS OF A 
/* CARD IMAGE ONTO THE REAR OF BUFF. 


/rekkekekekeeeekreeRRRRRHRRRRKRKRRRARRARKRKRRKR KR 
READ LINE: PROC; 
GET FILE(IN) EDIT(LINE) (A(80)); 
IF EOF="1°B THEN RETURN; 
PUT FILE(SYSPRINT) EDIT (’ *° ,LINE, °*’ )(SKIP, A, X(9),A(80) ,X(8),A); 
LINE=SUBSTR (LINE, 1, 72); 
BUFF=BUFF||° °||LINE; 
END READ LINE; 


jr eeaaekeeeeaeeRaRHRRRREHRRRRRARRHRKRERKREKRKRKRKREK 


/* CUT_STMT IS A PROC WHICH CUTS A STMT OFF THE FRONT OF BUFF. 
* 
g 0000eeeesececeseseceencenseseess 
CUT_STMT: PROC; 
I=INDEX(BUFF,°;°); 
STMT=SUBSTR (BUFF, 1,1); 
BUFF =SUBSTR (BUFF , [+1 ) ; 
END CUT_STMT; 
/* 
PARSE STMT: PROC; 
I=VERIFY(STMT,” “°); /* FIND POSITION OF 1ST NON-BLANK CHAR 
STMT=SUBSTR(STMT,I): /* CHOP OFF LEADING BLANKS. 
KW3=S UBSTR (STMT, 1, 3); 
SELECT (KW 3); 
WHEN (“DAT’) CALL DAT; 
WHEN (“VAR’) CALL VAR; 
OTHERWISE CALL SEL; 
END; /* SELECT */ 
END PARSE_STMT; 
/* 
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*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
* / 
*/ 


*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 


*/ 


*/ 
*/ 


*/ 


sists 


DAT: PROC; /* AT THIS POINT STMT HAS NO LEADING BLANKS. * / 
I=INDEX(STMT,’ °); /* FIND POSITION OF FIRST BLANK. * / 
IF I=0 THEN RETURN; /* IF NO BLANKS, THEN DEFAULT DSN=’WORK’. */ 
STMT=SUBSTR(STMT,1I); /* CHOP OFF STATEMENT KEYWORD. */ 
I=VERIFY(STMT,’ °); /* FIND THE FIRST NON-BLANK CHARACTER. */ 
STMT=SUBSTR(STMT,1I): /* CHOP OFF LEADING BLANKS. */ 
I=INDEX(STMT,’;°); /* FIND POSITION OF °;’ IN STMT. * / 
IF I=] THEN RETURN; /* IF °;’ IN FIRST POSITION THEN DSN=’WORK’ .*/ 
A DAT=SUBSTR(STMT,1,I-1): /* THE DSNAME IS STORED IN A DAT. * / 

END DAT; 

/* * / 

SEL: PROC; 

A _SEL=A_SEL||STMT; /* A_SEL STORES ALL THE SELECT STATEMENTS. * / 

END SEL; /* THE SELECT STMTS ARE CONCATENATED TOGETHER. */ 

/* * / 

VAR: PROC; /* AT THIS POINT THERE ARE NO LEADING BLANKS. * / 

IF INDEX(STMT,’ (’)>0O THEN ‘ 


I=MIN (INDEX(STMT,’ °),INDEX(STMT,’(’)); /* FIND THE NEXT CHAR * / 
ELSE I=INDEX(STMT,’ ’); 
STMT=SUBSTR(STMT,I); /* WHICH IS A BLANK OR A °(’ AND CHOP OFF */ 
I=VERIFY(STMT,’ °); /* KEYWORD VAR... AND * / 
STMT=SUBSTR (STMT, 1); /* CHOP OFF ANY LEADING BLANKS. */ 
DO WHILE (SUBSTR(STMT,1,1)“=’;°); /* DO WHILE IST CHAR “= °;”. * / 
IF SUBSTR(STMT,1,1)=’(’ THEN CALL GET INIT VAR; 
ELSE CALL GET VAR; 
END;/* DO WHILE */ 
END VAR; 
/* * / 
GET VAR: PROC; /* AT THIS POINT THE IST CHAR IS NOTA °(’ OR ° ”. */ 
IF INDEX(STMT,’(’)>0 THEN 
I=MIN (INDEX(STMT,°’ °),INDEX(STMT,°(*’)); /* FIND THE NEAREST °(’ */ 
ELSE I=INDEX(STMT,’ °); 
IF I1=0 THEN I=INDEX(STMT,’:”); 
VARR (VARR_INDEX)=SUBSTR(STMT,1,I-1); /* OR ° ° AND STORE THE VAR.*/ 
VARR_INDE2=VARR_INDEX+1; 


STMT=SUBSTR(STMT,1); /* CHOP OFF THE VAR. * / 
I=VERIFY(STMT,’ °); 
STMT=SUBSTR(STMT,1I); /* CHOP OFF ANY LEADING BLANKS. * / 
END GET VAR; 
/* * / 
GET INIT VAR: PROC; /* AT THIS POINT THE IST CHAR IS A ’(’. * / 
STMT=SUBSTR (STMT, 2); /* CHOP OFF LEADING ’(’. * / 
I=VERIFY(STMT,’ °); 
STMT=SUBSTR(STMT,1); /* CHOP OFF LEADING BLANKS. * / 
I=INDEX(STMT,’)’); /* LOCATE THE TRAILING ’)’. * / 
INIT_VARR (INIT _VARR_INDEX)=SUBSTR(STMT,1,I-1); /* STORE VAR W/O */ 
INIT_VARR_INDEX@INIT VARR_INDEX+1; /* THE PARENTHESES. */ 
STMT=SUBSTR(STMT,I+1): /* CHOP OFF VAR. * / 
I=VERIFY(STMT,’ °); 
STMT=SUBSTR(STMT,1I); /* CHOP OFF ANY LEADING BLANKS. */ 


END GET_INIT_ VAR; 
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/* 


/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 
/* 


* / 
BEGIN; 
DCL A MAI(M SIZE) CHAR(8) VARYING INIT((M SIZE)(’’)), 
A SPE(S SIZE) CHAR(8) VARYING INIT((S SIZE)(‘’)), 
A MNU(MN SIZE) CHAR(8) VARYING INIT((MN SIZE)(’’)), 
A MCH(MC SIZE) CHAR(8) VARYING INIT((MC SIZE)(’’)), 
A SNU(SN SIZE+20) CHAR(8) VARYING INIT((SN SIZE+20)(‘’)), 
A SCH(SC SIZE+20) CHAR(8) VARYING INIT((SC SIZE+20)(’’)), 
A RET(100) CHAR(8) VARYING INI'T((100)(’’)), 
INIT MNU(MN SIZE) CHAR(30) VARYING iNIT((MN SIZE)(‘’)), 
INIT MCH(MC SIZE) CHAR(30) VARYING INiT((MC SIZE)(‘’)), 
INIT SNU(SN SIZE+20) CHAR(30) VARYING INIT((SN SIZE+20)(’’)), 
INI? SCH(SC_SIZE+20) CHAR(30) VARYING INIT((SC SIZE+20)(‘’)), 
(VARX,TYPE) CHAR(8) VARYING, 
(M_INDEX,S INDEX,MN INDEX,MC_INDEX,SN INDEX,SC INDEX, 
R_INDEX, INIT MN INDEX,INIT MC_INDEX,INIT SN INDEX, 
INIT SC INDEX) FIXED BIN(15) INIT(1), 
SASLINE CHAR(72) VARYING INIT(’’), 
DBVARS TABLE(NO DBVARS,2) CHAR(8) VARYING; 


AkRKRKAARRAKHRAAKRKAR RRA * / 
e/ 

xkk DEFINITIONS *** */ 

* / 

A MAI: ARRAY USED TO STORE THE MAIN VARIABLES. * / 
A SPE: ARRAY USED TO STORE THE SPECIES VARIABLES. */ 
A MNU: ARRAY USED TO STORE THE MAIN NUMERIC VARIABLES. */ 
A MCH: ARRAY USED TO STORE THE MAIN CHARACTER VARIABLES. * / 
A_SNU: ARRAY USED TO STORE THE SPECIES NUMERIC VARIABLES. * / 


A SCH: ARRAY USED TO STORE THE SPECIES CHARACTER VARIABLES. * / 
A RET: ARRAY USED TO STORE THE RETRIEVED SPECIES VARIABLES. * / 
INIT MNU: ARRAY TO STORE THE INITIALIZED MAIN NUMERIC VARS. * / 
INIT MCH: ARRAY TO STORE THE INITIALIZED MAIN CHARACTER VARS. */ 
INIT _SNU: ARRAY TO STORE THE INITIALIZED SPECIES NUMERIC VARS. */ 
INIT SCH: ARRAY TO STORE THE INITIALIZED SPECIES CHAR VARS. * / 
DBVARS TABLE: TABLE ARRAY USED TO STORE AND LOOK-UP VARIABLES. */ 


kaeaaeRKRKRKREKEKREKRKRKRKEKRKRKREKRKEKRKREKRRAEKREKRRREKEKRKRKR EK K/ 


GET FILE(MENU) SKIP(1); 

DO I=1 TO M SIZE; 
GET FILE(MENU) SKIP EDIT(DBVARS TABLE(I,1))(A(8)); 
DBVARS TABIE(I,2)='M’; 

END; 

GET FILE(MENU) SKIP(1); 

DO I=M_SIZE+1 TO M SIZE+S SIZE; 
GET FILE(MENU) SKIP Eore(oovans TABLE(I,1))(A(8)); 
DBVARS TABLE(I,2)=’S 

END; 

GET FILE(MENU) SKIP(1); 

DO J=I TO I+MN SIZE-1; 
GET FILE(MENU ) SKIP EDIT(DBVARS TABLE(J,1))(A(8)); 
DBVARS TABLE(J,2)=’ MN’ ; 

END; 
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GET FILE(MENU) SKIP(1); 
DO I=J TO J+MC_SIZE-1; 
GET FILE(MENU ) SKIP EDIT(DBVARS TABLE(I,1))(A(8)); 
DBVARS TABLE(I,2)=’MC’ ; 
END ; 
GET FILE(MENU) SKIP(1); 
DO J=I TO I+SN_SIZE-1; 
GET FILE(MENU) SKIP EDIT(DBVARS TABLE(J,1))(A(8)); 
DBVARS TABLE(J,2)="SN’ ; 
END; 
GET FILE(MENU) SKIP(1); 
DO I=J TO J+SC_SIZE-1; 
GET FILE(MENU ) SKIP EDIT(DBVARS TABLE(I,1))(A(8)); 
DBVARS TABLE(I,2)=’SC’; 
END; 
DO I=] TO VARR_INDEX-1; 
TYPE=FIND(VARR(I)); 
SELECT( TYPE); 
WHEN(‘’M’) DO; 
A MAI(M_INDEX)=VARR(I); 
M_INDEX=M_INDEX+1; 
END; 
WHEN(’S’) DO; 
A SPE(S_INDEX)=VARR(I); 
S _INDEX=S_INDEX+1; 
END; 
WHEN(’ MN’) DO; 
A _MNU (MN | INDEX)=VARR(I); 
MN_INDEX=MN_INDEX+1; 


INIT MNU( INIT MN INDEX)=VARR(I)||’=.”; 


INIT MN _INDEX=INIT MN INDEX+1; 
END; 

WHEN(’MC’) DO; 
A MCH(MC_INDEX)=VARR(I); 
MC_INDEX=MC_INDEX+1; 
INIT _MCH(INIT MC_INDEX)=VARR(I) || 


INIT MC _INDEX=INIT MC INDEX+1; 
END; 

WHEN(’SN’) DO; 
A . SNU(SN | INDEX)=VARR(I); 
SN_ INDEX=SN _ INDEX+1 ; 
INIT | SNUCINIT | SN INDEX)=VARR(I) | 1’ =. 
INIT SN INDEX=INIT SN_INDEX+1; 
END; 

WHEN(’SC’) DO; 
A  SCH(SC , INDEX)=VARR(I); 
SC | INDEX=SC  INDEX+1 ; 
INIT '_SCHCINIT | SC _INDEX)=VARR(T) | 
INIT SC INDEX=INIT SC meRel 
END; we 
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OTHERWISE 


DO; /* IF ‘MISSING’ THEN ASSUMED TO BE */ 


END; /* SELECT */ 


END; /* DO */ 


DO I=] TO INIT VARR_INDEX-1; 


J=INDEX( INIT VARR(I),’ 


A RET(R_INDEX)=VARR(I); /* A SPECIES */ 
R_INDEX=R_INDEX+1; /* NAME. */ 
END; 

=’); /* FIND THE ‘=’ SIGN IN */ 


VARX=SUBSTR(INIT VARR(I),1,J-1); /* INIT VARR. CHOP OFF*/ 


TYPE=F IND(VARX) ; 


SELECT(TYPE) ; 
WHEN( ’ MN” ) 


WHEN( ’ MC’ ) 


WHEN(‘’ SN’ ) 


WHEN( ’ SC“ ) 


WHEN(‘ M’ ) 


WHEN(‘S’ ) 


OTHERWISE 





~/* THE VARIABLE AND STORE IN VARX. 
/* FIND VARX IN THE DB-VAR-TABLE. 


* / 

* / 

DO; 

A _MNU(MN INDEX)=VARX; 

MN INDEX=MN_INDEX+1; 

INIT MNU(INIT MN  INDEX)=INIT | VARR(I); 

INIT MN_INDEX=INIT MN INDEX+1; 

END; 

DO; 

A MCH(MC_INDEX)=VARX; 

MC_INDEX=MC_INDEX+1 ; 

INIT MCH(INIT MC  INDEX)=INIT | VARR(I); 

INIT MC_INDEX=INIT MC_INDEX+1; 

END; 

DO; 

A_SNU(SN_INDEX)=VARX; 

SN_INDEX=SN_INDEX+1 ; 

INIT SNU(INIT_SN | INDEX)=INIT | VARR(1); 

INIT SN_INDEX=INIT_SN_INDEX+1; 

END; 

DO; 

A _ SCH(SC |_INDEX)=VARX; 

SC_INDEX=SC_INDEX+1; 

INIT SCH(INIT SC  INDEX)=INIT  VARR(I); 

INIT SC_INDEX=INIT SC_INDEX+1; 

END; 

DO; 

CALL ERROR; 

A MAI(M_INDEX)=VARX; 

M_INDEX=M_INDEX¢+1 ; 

END; 

DO; 

CALL ERROR1; 

A SPE(S_ INDEX)=VARX; 

S INDEX=S_INDEX+1; 

END; 

DO; /* IF ‘MISSING’ THEN ASSUMED TO BE */ 

IF INDEX(INIT VARR(I),’’’’)=0 THEN 
DO; /* AN INITIALIZED VAR WHICH */ 
A_SNU(SN_INDEX)=VARX; /* WILL */ 
SN_INDEX=SN_INDEX+1; /* APPEAR IN */ 
INIT SNU(INIT SN } INDEX)=INIT | VARR(I); 
-_ SN_INDFX=INIT SN | INDEX+1 ; 

; /® EVERY OBSERVATION OF THE */ 
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ELSE DO; /* OUTPUT DATASET. * / 
A_SCH(SC_INDEX)=VARX; 
SC_INDEX=SC_INDEX+1; . 
INIT SCH(INIT_SC - INDEX)=INIT ' VARR(I); 
INIT SC | INDEX=INIT SC_INDEX+1; 
END; 
END; /* DO */ 
END; /* SELECT */ 
END; /* DO */ 
/* * / 
FIND: PROC(X) RETURNS(CHAR(8) VARYING); 
DCL X CHAR(8) VARYING, 
I FIXED BIN(15); 
DO I=] TO " DBVARS; 
IF X=USVARS TABLE(I,1) THEN RETURN(DBVARS TABLE(I,2)); 
END; 
RETURN( ‘ MISSING’ ); 
END FIND; 
/* * / 
ERROR1: PROC; 
PUT FILE(SYSPRINT) SKIP LIST(’ERROR: °||VARX|| 
* MAY NOT BE INITIALIZED.’ ); 





END ERROR]; 
/* */ 
/* */ 
jreereeeekeeeeekeekeeeekeeeeeeeee eee ee K/ 
/* */ 
/* THE SAS CODE WILL NOW BE GENERATED. */ 
/* * / 
jr ereraeeeeeeeeekekekeRRRHRKRKRERRRHRARKRREAR RAK KEK K/ 
CALL OUT PUT 

(‘reek akRaRRRRRKRKRKRARRRARKRRRRARKRR RR KKH KR), 
CALL OUT PUT 

(‘% *’); 
CALL OUT PUT 

(‘* SAS PROGRAM FOR RETRIEVAL #*’); 
CALL OUT PUT 

(‘* x’); 
CALL OUT PUT 


(‘rR aRRRARRARARRARARARRARKRARRARARRR RR RR KK’); 


RECORD=SUBSTR( RECORD, 1,1); 

IF RECORD “=’0’ & RECORD*=’S’ THEN DO; RECORD=’0’; CALL OUT PUT 
(’*****RECORD PARAMETER UNKNOWN, OBSERVATION ASSUMED;’); END; 

CALL OUT_PUT(’ DATA’ ); 

CALL OUT _PUT((A DAT)); 

CALL OUT PUT(’ (KEEP=" ); 
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I=] TO M_INDEX-1;CALL OUT PUT((A MAI(I)));END; 
I=1 TO S_ , INDEX-1;CALL OUT PUT((A | SPE(I)));END; 
I=] TO MN INDEX-1; CALL OUT ' PUT((A . MNU(I))); END; 
I=1 TO MC_INDEX-1;CALL OUT PUT((A MCH(I)));END; 
I=] TO SN. | INDEX-1;CALL OUT PUT((A | , SNU(L))); END; 
I=1 TO SC | . INDEX-1;CALL OUT PUT((A | . SCH(I))); END; 
I=1 TOR . INDEX-1; CALL OUT  PUT((A . RET(L))); END; 
CALL OUT PUT(’);’); 
CALL OUT PUT(’ MERGE MAIN.MAIN] SPECIES.SPECIES] NUMVAR.NUMVARI’ ); 
CALL OUT PUT(’ CHARVAR. CHARVARI ;’ ); 
CALL OUT PUT(’ BY KEY KEY1;’); 
CALL OUT PUT(’ RETAIN KEEP ’); 
I=] TO M_INDEX-1;CALL OUT PUT((A MAI(I)));END; 
I=1 TO S| . INDEX-1;CALL OUT PUT((A | SPE(1L)));END; 
I=1 TO MN | INDEX-1; CALL OUT PUT((A MNU(I)));END; 
I=1 TO MC _ INDEX-1 ; CALL OUT PUT((A | MCH(1L)));END; 
I=1 TO SN INDEX-1;CALL OUT PUT((A | SNU(I)));END; 
I=] TO SC | | INDEX~1;CALL OUT PUT((A | SCH(1))); END; 
I=] TO R INDEX-1; CALL OUT  PUT((A | RET(I)));END; 
CALL OUT PUT(’;’); 
CALL OUT PUT(’ IF FIRST.KEY THEN DO;’); 
IF RECORD=’0’ THEN CALL OUT PUT(’KEEP=’’NO ’’;’); 
DO I=1 TO M_INDEX-1; 

CALL OUT PUT((A_MAI(I)||°=T’ | JA MAI(I)11°3°)); 
END; 
DO I=1 TO R_INDEX-1; 
CALL OUT PUT((A_RET(I)|1’=0;")); 


S8S3sss8 


SSS8888 


END; 
DO I=1 TO INIT MN INDEX-1;CALL OUT _PUT((INIT_MNU(I)11‘;’));END; 
= I=1 TO INIT | MC - INDEX-1;CALL OUT PUT((INIT | MCH(1)11°;°));END; 
CALL OUT _PUT(’END;’); 
CALL OUT PUT(’IF FIRST.KEY1 & KEY] NOT="’ ‘* THEN DO;’); 
IF RECORD=’S’ THEN CALL OUT _PUT(’ KEEP=’ ‘NO arte Fr 
CALL OUT PUT(’SPECIES=KEY1;”); 
DO I=1 TO INIT SN _INDEX-1;CALL OUT_PUT((INIT_SNU(I)11’;’));END; 
DO I=1 TO INIT SC | _INDEX~1 ; CALL OUT PUT((INIT SCH(1)|1’;’));END; 
CALL OUT PUT(’END;‘); 
DO I=1 TO MN INDEX-1; 
CALL OUT PUT 
((’IF VAR=’’’||A MNU(I)|1’’’ THEN ’||A MNU(I)||’=VALUE; ELSE’ )); 
END; 
DO I=1 TO SN _INDEX-1; 
CALL OUT PUT 
((°IF VAR=’’’||A SNU(I)||°’’ THEN ‘||A SNU(I)||’=VALUE; ELSE’ )); 
END; CALL OUT PUT(’;’); 
DO I=1 TO MC_INDEX-1; 
CALL OUT PUT 
((’IF CVAR=’’’||A_MCH(I)||‘’’ THEN ’||A_MCH(I)||‘=CVALUE; ELSE’)); 
END; 
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DO I=1 TO SC_INDEX-1; 
CALL OUT PUT 
((’ IF CVAR="’"’ ||A SCH(I)1||°*’’ THEN “| |A_SCH(L) || °=CVALUE; ELSE’ )); 
END; CALL OUT PUT(’;’); 
DO I=1 TO R_INDEX-1; 
CALL OUT PUT((‘ IF SPECIES=’“’||A RET(I)|1°’’ THEN “1 1A _RET(I) {I 
’=NUMBER; ELSE’ )); 
END; CALL OUT PUT(’;°); 
DO WHILE(LENGTH(A SEL)>0 & VERIFY(A SEL,’ “)>0); 
IF LENGTH(A SEL)<73 THEN 
DO; 
CALL OUT PUT((A SEL)); 
A SEL=""; 
END; 
ELSE 
DO I=71 TO 1 BY -1; 
IF SUBSTR(A SEL,1I,1)=’ * THEN 
DO; 
CALL OUT PUT((SUBSTR(A SEL,1,1))); 
A SEL=SUBSTR(A SEL, I+1); 
I=1; 
END; 
END; 
END; 
IF RECORD=’S’ 
THEN CALL OUT PUT(’ IF LAST.KEY1 & KEY1 NOT=’ ’ “* & KEEP=’ “YES’’’); 
ELSE CALL OUT PUT(’ IF LAST.KEY & KEEP=’’YES’’”’); 
CALL OUT _PUT(’ THEN OUTPUT; ’ ); 
CALL OUT PUT 
(‘eR RRR RRR RRR RRR RAR RK KR RR KR), 


CALL OUT PUT 


(’* SAS.PROG x’); 
CALL OUT PUT 
(‘sea RRR RRR RRKRARRKRKRKRKRRKRRRKR RK KR KR’): 
/* * / 


OUT PUT: PROC( ADDON); 
DCL ADDON CHAR(72) VARYING; 
IF LENGTH(SASLINE| |ADDON)>71 THEN 
DO; 
PUT FILE(OUT) EDIT(SASLINE)(SKIP,A(72)); 
SASLINE=" “° || ADDON; 
END; 
ELSE SASLINE=SASLINE||’ *’ || ADDON; 
IF INDEX(SASLINE,’;’)>0 THEN 
DO; 
PUT FILE(OUT) EDIT(SASLINE)(SKIP,A(72)); 
SASLINE=" ’ ; 
END; 
END OUT PUT; 
END;/* BEGIN */ 
END RL2; 


U.S. GOVERNMENT PRINTING OFFICE: 1980-672-197/509 
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